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Abstract 

Volatility measures the amplitude of price fluctuations. Despite it is one of the most important 
quantities in finance, volatility is not directly observable. Here we apply a maximum likelihood 
method which assumes that price and volatility follow a two-dimensional diffusion process where 
volatility is the stochastic diffusion coefficient of the log-price dynamics. We apply this method 
to the simplest versions of the expOU, the OU and the Heston stochastic volatility models and 
we study their performance in terms of the log-price probability, the volatility probability, and 
its Mean First-Passage Time. The approach has some predictive power on the future returns 
amplitude by only knowing current volatility. The assumed models do not consider long-range 
volatility auto-correlation and the asymmetric return-volatility cross-correlation but the method 
still arises very naturally these two important stylized facts. We apply the method to different 
market indexes and with a good performance in all cases. 
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I. INTRODUCTION 



Volatility is a magnitude aiming to capture how big is the amplitude of price return 
fluctuations [HE]- It is associated with the risk of holding an asset stating that the higher the 
volatility the riskier the market price. Investors pay sometimes more attention to volatility 
than to the price level or the current trend of a stock. The role of volatility becomes even 
more crucial when trading with financial derivatives like options since the value of volatility 
almost fully determines the price of this sort of contracts [U [2]. However, the volatility 
itself is not directly observed and the financial markets and their actors lack of an unique 
consensus for providing its value. 

Therefore, there is no other choice than trying to infer in some way or another the value 
of volatility from price time series. In practice, this means that it is necessary to first assume 
a model governing financial asset dynamics and second to extract volatility value from data 
time series under the perspective of the model dynamics considered. 

The physicist Osborne proposed the Geometric Brownian Motion model (GBM) in 
1959 [3J. The GBM difussion process drives the logarithmic price changes with a constant 
diffusion coefficient typically called volatility. In this case, computing market volatility first 
means to calculate the standard deviation of the logarithmic price changes over time peri- 
ods of length At. And, secondly, volatility would then be the ratio between the standard 
deviation and the square root of At since we are implicitly assuming the GBM difussion 
model. 

Further studies in financial data have led to establish that the GBM is very incomplete j2] 
and it appears to be unable to explain quite a long list of stylized facts observed in financial 
markets [21 H]. Specially during the last two decades, several models have been proposed 
with the aim of capturing (i) the existence of fatter tails in the log-price fluctuations, and (ii) 
the presence of non-trivial memory in the market dynamics [2] . A very natural improvement 
of the GBM is to consider volatility as a random process following another continuous time 
diffusion process [5HT2] . The price and the hidden Markov process for the volatility therefore 
configure a two-dimensional difussion process and the approach belongs to the so-called 
stochastic volatility (SV) modeling [131 [H]. The approach is analogous to random diffusion 
modeling which describes dynamics of particles in random media and applicable to a large 
variety of phenomena in statistical physics and condensed matter [To] . 
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Among the existing SV models (T2HTH [16], the most basic ones are the Ornstein- 
Uhlenbeck (OU) [T7HTH], the Heston model [20H22] being in fact a Feller process, and the 
exponential Ornstein-Uhlenbeck (expOU) [23H2S]- With the aim of extracting volatility 
from financial markets data, the current work develops much further the maximum like- 
lihood (ML) estimation applied to the expOU model in Ref. [26] by one of us. We here 
extend the methodology to the OU and Heston SV models but we also study some of the 
most important statistical features observed in financial markets [2JIHEI2]: the return and 
volatility probability densities (pdf's), the volatility auto-correlation and the leverage cor- 
relation, and the Mean First-Passage Time. For doing all these, we use eight daily indexes: 
Dow Jones Industrial Average (DJI), Standard and Poor's-500 (S&P), German index DAX, 
Japanese index NIKKEI, American index NASDAQ, British index FTSE-100, Spanish in- 
dex IBEX-35 and French index CAC-40. We also provide the method abilities of predicting 
future absolute value of price returns knowing today's volatility. 

This paper is divided into five sections. In Section [IT] we present the SV models and 
their main characteristics, while in Section III we show the ML approach. In Section IV we 



provide results obtained from our algorithm. Conclusions are left to Section [V] 

II. THE STOCHASTIC VOLATILITY MARKET MODELS AND BASIC 
VOLATILITY ESTIMATORS 

The starting point of any SV model is the GBM model [3] 

^^-=ndt + adW l {t), (1) 

where dW\(t) corresponds to a Wiener noise (i.e., a zero mean and unit variance Gaussian 
process), S(t) is a financial price or the value of an index, /i is the drift and a is the volatility. 
If we define the zero-mean return X(t) as 

where t is the initial time. Let us note that X(t) assumes independent and stationary 
increments in the financial time series since Osborne's work in 1959 [3]. We can however 
rewrite Eq. Q as follows 

dX(t) = a(t)dWi(t). (3) 



The term a was initially considered to be constant. However, most of the existing market 
models nowadays assumes that the term a -also called volatility- is a time varying variable. 

SV models assume that the volatility is a hidden Markov process a(t) = f(Y(t)) where 
Y(t) obeys a subordinated diffusive stochastic differential equation. Under this perspective, 
the two-dimensional dynamics reads [14] 

dX(t) = fiYitydW^t), (4) 
dY(t) = -g(Y(t))dt + h(Y(t))dW 2 (t), (5) 

where Wi(t) {i = 1,2) are Wiener processes that may or not be independent. As f(y) is 
always defined as a monotonically increasing function, Y(t) is sometimes also called volatility. 
As shown in Tab. [TJ each model has its own expressions of f(y), g(y) and h(y). The 
proposed models in the literature change in terms of these functions but in general there is 
a wide consensus to consider process with a (negative) mean reverting force that leads the 
probability density function of the volatility to a stationary solution when time is sufficiently 
large. 

Let us focus on the volatility estimation procedures. As a first approximation and as 
mentioned in the introduction, the volatility can be viewed as the standard deviation of the 
empirical daily zero-mean return changes 



&GBM 



I At 

As we are considering daily data, we are assuming discrete time increments At = 1 day and 
discrete return increments AX(t) = X(t + 1 day) — X(t). In such a case, we are implicitly 
assuming the GBM provided by Eq. ^ with constant volatility in daily units. 

As a second level of approximation we allow for time varying volatility. Observing Eq. (|3]), 
we now define volatility as 

|AX(t)| 

aprop(t) " pwi(t)I)' { ) 

and we have different volatility for different days. However, the volatility obtained has a 
skewed stationary probability density inconsistent with volatility modeling as discussed in 
Refs. [21 [23]. 

A third possibility is to compute a deconvoluted volatility 



^deconW 



AX(t) 
AWi(t) 



(7) 
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TABLE I: Volatility expressions in terms of f(y), g(y) and h(y) appearing in Eq. ([5]). These models 
have three constants: the normal level of volatility m, the driving force a that drives volatility to 
m, and the amplitude of volatility fluctuations k often called volatility-of- volatility |14j . 



expOU OU Heston 

f(y) mey y y 1 / 2 
g(y) ay a(y - m) a(y - m) 
h(y) k k ky 1 / 2 



which does not show a skewed probability density for the volatility but its greatest drawback 
is that estimated volatility appears to be a very noisy signal (see for instance Refs. [2j [231 
[23 EE] for alternative approaches and further discussions). 

III. MAXIMUM LIKELIHOOD APPROACH 

We here briefly present the methodology proposed in Ref. |26j that allows us to have some 
criteria for choosing the best values of the random realization AW^. Naively speaking, the 
method represents an improvement of the deconvoluted volatility u^econ estimator using a 
ML methodology. 

To explain the procedure it is more convenient to work with the discrete time version of 
the model. To this end, suppose that At is a small time step and that the driving noises in 
Eqs. (|4])-(|5| can be approximated by 



dWi(t) « ei(t)VAt, (i = 1,2), (8) 

where £j(t) are independent standard Gaussian processes with zero mean and unit variance. 
The discrete time equations of the model describing increments of X(t) and Y(t) thus read 

AX(t) = /(y(t))s 1 (t)v / At (9) 
AY(t) = -g(Y(t))At + h{Y{t))e 2 (t)VAt (10) 

where AX(t) = X(t + At) - X(t) and AY{t) = Y(t + At) - Y{t). From Eqs. @-(|l0|), we 
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can get 



£l {t) 
e 2 (t) 



AX(t) 
AY(t)+g(Y{t))At 



(11) 
(12) 



h(Y(t))VAt 

For simplicity we assume that Si and e 2 are independent standard Gaussians. We will discuss 



in Section IV D that our methodology does not need to consider negative cross-correlation 



among these two Gaussian variables as discussed for instance by the models studied in 
Refs.pJEHiEH]. Hence, 



P(£i,£ 2 ) = (l/27r)exp 



el)/2 



and this finally can be transformed into the conditional probability density function (pdf) 



l/(27rAt) 



P(X(r), Y{t)\X(t - At), Y (r - At)) = 
e\{r- At) + el(r- At) 



f(Y(r-At))h(Y(r-At)) eXP [ " ' 2 " 'J" (13) 

by including the Jacobian of the transformation (X(t),Y(t)) — > (ei(r — At),e 2 (' r — At)) 
defined by Eqs. @-(|Io|). 

For a given number of realizations, the probability of the set {X,Y} for the period 
(r — t,t— At, ... ,t — s) can be easily obtained. The Markov property of the process ensures 
that one can decompose the joint pdf of this set as a chain of products between conditional 
pdf's 



P({X,Y}) = P(X(t - s),Y(t - s)) 



t 



x J] P(X(r),Y(r)\X(r-At),Y(r-At)). 

T=t+At—S 



(14) 



Substituting Eqs. ( 1 1 )-( 12 ) into Eq. (13) and inserting them into Eq. (14), we apply the 



chain of products between conditional pdf's and we finally get the joint pdf 

s ln(27rAt) 



lnP({X,Y}) 



At 



- £ [ln/(r(r-At)) + ln/ i (r(r-At))] 

T=t + At-S 



+ lnP{X{t- s),Y{t- s)) 



E 

=t+At- 



X{t) - X{t - At) 
f(Y(r-At))At 



E 

T=t+At-S 



Y(r)-Y(r-At) g(Y(r - At)) 
h(Y(r-At))At h{Y{r-At)) 



At 



At. 



(15) 



We remind that our aim is to find a proper realization of the volatility Y given a return 
X and this will be done by applying a ML procedure to variable Y. For this reason, we will 



be able to omit three terms in Eq. (15). The first summand comes from the normalization 



constant of the Gaussian distribution (13). It appears in every conditional probability 



density and this is the reason for the factor s/At, which is the number of time steps between 
t — s and t. The resulting term does not depend on the realization, so that we can neglect it 
for a maximization with respect to the set of realizations Y. The second summand is mostly 
the sum of the Jacobian transformations of each transition probability. Stochastic volatility 
models assume that these / and g are continuous and monotonically increasing functions or 
even constants. Because of this, we can also neglect this term in the maximization procedure. 
The term lnP(X(t — s), Y(t — s)) is fixed by the initial conditions of the process. We could 
here assume a known initial return X - which can be set to zero - and take a random Y(t—s) 
following its stationary distribution. Therefore we would have P(X(t — s),Y(t — s)) = 
5(X(t — s) — X) P st (Y(t — s)). Had we taken another initial condition, the technique would 
have given equivalent results (we have checked this by using several initial distributions) . For 
this reason and in order to improve the convergence of the ML estimate we have neglected 
also this contribution. 
We can therefore write 

.nP(X,Y),4 t [ AXiT - At) ^ 



T=t+At-S 



At 



E 



T=t + At-S 



Y(t-M) , g(Y(r-At)) 



h(Y(T-At) h(Y(r-At)) 



f(Y(r - At))At 

1 2 

At H 



(16) 



and omit the other three terms for the reasons summarized above (cf. Eq. (15)). Further 
details can be found in Ref. \26\. 



Let us finally briefly provide an interpretation for the two remaining terms in Eq. (16). 



The first term of Eq. ( 16 ) measures the return variations with respect to the volatility. We 



notice that the higher the fluctuations are, the lower the contribution to the probability is. 
The second term computes the fluctuations of the volatility with respect to the volatility of 
the volatility. Again, the bigger this term, the lower the contribution. 
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FIG. 1: A comparison between different volatilities as a function of time. On the top we observe the 
deconvoluted volatility o^econ computed using Eq. (ITl). The other plots show estimated volatilities 



°est calculated for the three different models as explained in Section III A and Eq. (19). 
A. The Algorithm 

As mentioned above, our goal is to find a proper realization of the volatility series Y 
given return series X which is directly observed and taken from empirical data. We then 
should however consider the following conditional probability of a single event 

In P(Y | X) = In P(X,Y) -In P{X). (17) 

And as we solely want to maximize this probability for a fixed set of returns configuring a 



path, the second term can be neglected and therefore maximizing Eq. (17) is equivalent to 



maximizing Eq. (16). In practice, the method therefore computes different realizations of 



volatility variable for a given return path and ML estimation dictates that we should take 



the realization that makes bigger the probability given by Eq. (16). The method filters the 
Wiener noise AWi(t) and let us obtain an estimation Y es i(t) of the hidden volatility for a 
given price return evolution. 

Specifically, we have implemented an algorithm which sequentially follows the four steps: 



1. Looking at Eq. Q, we generate a simple realization of Y by taking 

y tW - r 1 ( AX{ ^ 

Yest[T) ~ f { AWUr) 



(18) 
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where t - s < r < t, with AI(r) = X(r + At) - X(t) taken from data, and AWi(r) 
being a zero mean and unit variance Gaussian realization. 



2. We substitute Y es ^ and X into Eq. (16) and we then compute the probability. 



3. We iterate I times the steps 1 and 2. We finally keep the realization that brings a 



higher probability in Eq. (16) and define it as y es t(t). 
4. Finally, the estimator of the volatility at time t is 

<W*) = /CW*))- (19) 

We observe that this procedure depends on / and s. We have implemented the algorithm 
with s = 10 and / = 100, 000. We have used these values because larger time window s and 
a larger number I of iterations do not improve the quality of our estimation. 

We observe that cr^ econ (cf. Eq. rf7])) is calculated with a single computed random value 
AWi while a es i chooses an optimal value after I iterations. As observed in Fig. [TJ cr es t with 
Dow Jones daily data from October 1928 to July 2011 and in all studied models is less noisy 
than cr^ econ . The fluctuation values of the deconvoluted is three or four orders of magnitude 
larger than the fluctuation values of the three ML algorithms herein proposed. 

We also stress the fact that the SV model jointly with their parameters are chosen before 
starting the computation. The parameters can however be easily estimated beforehand 
using historical data J2E]. See for instance Refs. [3"0TI3"8"] for alternative procedures for 
reconstructing volatility being more or less dependent on the volatility model chosen. Some 
of these approaches also include the parameter estimation procedure within the volatility 
estimation. Others are mainly devoted to capture the long term memory of the volatility. 

IV. RESULTS AND COMPARISON BETWEEN MODELS 

We here study the probability density of the volatility, the conditional return, the Mean 
First Passage Time (MFPT) and the two most important correlations with time (volatility 
auto-correlation and return-volatility asymmetric correlation or leverage effect) along the 
three different SV models. Data to perform comparisons across the different models de- 



scribed in Sections IV A IV D corresponds to the Dow Jones daily data from October 1928 



to July 2011 but Section IV E extends the survey to other financial market indices. 



TABLE II: Parameters, measured in daily units, for the three SV Models. 



ham 

OU 1.4 x 1(T 3 5 x 1(T 2 1.2 x 1(T 2 
Heston 2.45 x 1(T 3 4.5 x 1CT 2 8.62 x 1(T 5 
ExpOU 4.7 x 10~ 2 1.82 x 1(T 3 8 x 1(T 3 
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OU 

Heston 
Deconvoluted 
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FIG. 2: Probability distribution of the different volatilities in semi-log scale: c^econ 

(cf. Eq. 

for the Dow Jones data and the cr es t for the expOU, the OU and the Heston cases (cf Eqs. (|16[), 



(19) and Table pi). We also include theoretical stationary pdf forms for each model. The expOU 



seems to be the one that better corroborates theoretical pdf form. 



The parameters we use for the numerical calculations are those given in literature to 
reproduce the DJI (TSJ [2TJ [23] and they are summarized in Tab. [ill 



A. Behavior of our estimator 



In order to compare how our algorithm works on each model, we have first calculated 
the probability distribution of the different volatilities. Just for the sake of completeness we 
represent the stationary volatility probability density function (pdf) in Fig. [2] thus showing, 
as expected, that the form of the curves depends on the model choice. It should be noticed 



10 




daily return differences 

FIG. 3: Comparison between the probability density of the return differences AX calculated using 
Eq. ([3]) . We observe that the expOU model is the one that provides worse agreement with empirical 
data probably because of its high sensitivity of the parameter calibration. 

that we have used the absolute value of the volatility in the case of the OU model for 
the whole paper. Figure [2] also shows that best agreement between theoretical curve and 
empirical data points corresponds to the expOU case. Several studies in the literature have 
measured volatility stationary pdf [2j |23j [27J I2H] and all of them suggest an exponential 
decay corresponding to a log-normal curve [231 12H] or an inverse gamma distribution [2] at 
least with low frequency data. It shall however be noted that a very recent model with a 
two-dimensional diffusion process succeeds to provide an inverse gamma distribution [12] 
and it can be indeed interesting to apply the methodology to this new model. 

We also compute artificial return fluctuations AX(t) for each model by multiplying cr es ^(t) 
with a Wiener noise realization as given by Eq. ([3]). Doing that, we can somehow compare the 
daily zero-mean return pdf of the three SV models with the empirical data of daily returns 
AX(t). In Fig. [3j we observe that the peak of empirical data AX(t) is not reproduced 
by any model. In Fig. [3] we see that the tails of the real AX(t) are similar to empirical 
data in all models. The differences among the models can be explained by the fact that 
the parameter estimation in each model has not been systematically optimized. We observe 
that the expOU model is the one that provides worse agreement with empirical data. A 
possible reason for that might be due to the fact that the expOU model has a multiplicative 
relation with the underlying random process Y with a = f(Y) = mexp(F) and therefore 
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FIG. 4: Logarithm of the median of the empirical return differences as a function of the logarithm 



of the estimated volatility ( 19 ) for the three different models. All the models are shifted for better 



understanding. In brackets, we can find the value of the slope of the linear regression. The points 
represent the medians and the error bars are the first and third quartiles in the bins. 

needs a really accurate calibration (cf. Tab. [T]). 



B. Predictive power of the method 

This section aims to look for some inferred behavior in future absolute value zero-mean 
return based on the estimation of current value of volatility. We first consider the logarithm 
of Eq. @ 

In \AX(t)\ = lna(t) + In |AWi(t)|, (20) 

and we can now obtain the conditional median of the empirical ln|AX(t)| = In \X(t + 
lday) — X(t)\ given we know lncr(t) through our ML method. In such a case, we should 
have the following linear regression for the conditional median 



M [in \AX(t) 1 1 In a(t)] = In a(t) + ct 



(21) 



where ct is a constant. In Fig. [4] we plot this relationship using the three different models. 
We there however observe the slopes are not equal to 1. In this sense, Heston and OU model 
have the best performance although we should take into account that the performance might 
be very sensitive to the efficiency of the parameter estimation procedure. 
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FIG. 5: Representation of the magnitude j(h) that appears in Eq. (22). The error bars correspond 
to the error on the slope of the regression of Fig. [4} The data has been divided in two regimes in 
the case of the expOU and the OU models. All the plots are also shifted for sake of readability. 



TABLE III: Experimental values of the coefficients of Eq. (23). The expOU and the OU models 
show a double time scale while the Heston model has a single time scale. Number 1 is valid for 
h < 7 while number 2 applies for h > 7. 



expOU 1 


expOU 2 


OU 1 


OU 2 


Heston 


a -0.12 


-0.064 


-0.15 


-0.064 


-0.048 


b 0.82 


0.72 


0.85 


0.67 


0.63 



In any case, we still have a linear regression measuring how big is going to be price 
fluctuation today based on yesterday's volatility level. One can go one step further and use 
the observed relationship between price fluctuations and volatility to forecast price changes 
amplitude at a longer time t + h based on volatility at time t. A reasonable modification of 



the conditional median given by Eq. (21) is 



M 



ln\AX(t + h)\ lno-(t) = j(h) lner(t) + ct 



(22) 



which was already proposed in Ref. but solely applied to the expOU case. We here 
therefore calculate ^{h) in terms of time horizon h for the expOU, the OU and the Heston 
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cases. Figure [5] shows a linear relation between 'y(h) and ln(/i) for the three cases. We 
therefore propose the heuristic formula 



M 



In \AX(t + h)\ In a(t) = (a In h + b) In a(t) + ct, 



(23) 



where a and b are the coefficients of the regression. Table III shows the empirical values of the 
regression. Since we also observe a distinct behavior between short and long time horizon 
in the cases of the expOU and the OU models, we also provide two different regression 
parameters a and b. 



C. Mean First-Passage Time 

First-passage and extreme value studies have a long tradition of applications to physics, 
biology, chemistry, and engineering, all of them related to non equilibrium processes. This 
sort of events appear also to be important in the financial markets context as a valuable tool 
to calibrate risk in a more sophisticate manner than just providing the standard deviation. 
It also does represent an alternative and, in a way, improved method [3H] to the so-called 
Value at Risk [40] . First-passage and other extreme value have already been analytically and 
empirically studied under the perspective of the here presented SV modeling [39j EJ l4"2] . 
In this section, we focus on the Mean First-Passage Time (MFPT) of the volatility which 
provides the average time spent by price fluctuations |AX| to cross a certain value A. See 
Ref. jH] for a further theoretical input concerning the MFPT and the SV models herein 
studied. 

We here want to extend the analysis with the use of our ML method instead of simply 
taking the absolute value of price returns as was done in Ref. [4Tj . In order to compare 
different models, we have to work with the dimensionless magnitude L = X/a s where a s = 
(a(t)) s is the mean of the estimated volatility in the stationary limit (t — > oo). The expected 
stationary volatility [41J for the expOU model is 

a s = m exp (fc 2 /4a) , 

for the OU model is a s = m, and for the Heston model is 

kY(2am 2 /k 2 + 1/2) 
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FIG. 6: MFPT of the return differences calculated using Eq. ([3]). The estimated volatilities (19) 
of the expOU, the OU and the Heston models are compared with the Dow Jones |AX| data. 



TABLE IV: Scaling exponents /3 of the MFPT of real data AX and artificial data of Heston, OU 
and expOU models. All the curves in Fig. [6] have a characteristic exponent for L < 1 and another 
for L > 1. 



expOU OU Heston DJI data 

L<1 1.1 0.8 1.0 1.3 
L> 1 2.4 3.1 2.9 2.9 



The MFPT of the three models is computed with their own volatility estimation mul- 
tiplied by an artificial Wiener random realization AW\. Figure [6] compares the different 
results with a qualitative agreement with empirical data in all three cases. The expOU case 
appears to be the closest to the empirical MFPT curve. Figure [6] shows that the empirical 
MFPT results and the three artificial ones can all of them be roughly described by 

MFPT(L)~cL /3 (24) 

with exponent and coefficient that changes depending whether L < 1 or L > 1 as also shown 



in Ref. EH. Their values are shown in Tab. IV 
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FIG. 7: Comparison between the autocorrelation of our volatilities jointly with the autocorrelation 
of the proportional (Jprop provided by Eq. 

D. Correlations 

We now study how the ML approach keeps the main market time correlations that deeply 
and non-trivially involves volatility dynamics |2J. It is well-known that the volatility fluc- 
tuations have long memory correlation (over a year) and that volatility also shows negative 
and asymmetric cross-correlation with return changes (over several weeks), i.e. the leverage 
effect p] . However, it is not clear whether the proposed method is able to provide these two 
different correlations. 

Figure [7] shows how the volatility autocorrelation 

C{T) = VarM (M) 

of each estimator is still significant for up to hundreds of days. It is important to stress 
that the OU and Heston models by themselves do not have this long range correlation since 
their mathematical expressions give an exponential decay for the volatility a in terms of a 
characteristic time scale 1/a (see Ref. [TU] and Tab. [I] for the meaning of this parameter). 
The expOU model is the only one that explains this long range effect with a cascade of 
exponentials [23] . Therefore, it can be said that the long-term memory is preserved due 
to the ML algorithmic method herein proposed. This feature manifests the robustness and 
effectiveness of the proposed method beyond the choice of the SV models been used. 

We now focus on another important correlation with time. The so-called leverage effect (2] 

16 



a?) = ^ '7. 'I (26) 



defined by 

(AX {t)a(t + rf) 

measures the negative cross-correlation between price return fluctuations and volatility. Ref- 
erence [29] shows that the three models are able to mathematically describe the empirical 
observation only if a non-zero and negative cross-correlation between AW\ and AW? is con- 
sidered (cf. Eq. ([5])). Figure [8] shows the leverage correlation by first obtain the estimated 
volatility ( p6| ) a es i and afterward compute the artificial return change AX by multiplying 
the estimated volatility by random realizations of AW\. We remind that the current ML 
algorithmic method has not considered correlation. However, the iterative procedure of the 
ML method is able to naturally provide the leverage effect in the three models as shown. It 
is important to stress the fact that we do not need to sophisticate our models by including 
the cross-correlation coefficient between AW± and AW2 since the same ML procedure nat- 
urally includes the negative correlation between these random sources. Adding the effect of 



correlation between AW\ and AW2 represents adding more terms in Eq. (16) and making 
the ML approach much less efficient in computational terms. The addition of this extra 
term would in any case provide redundant information to the maximization process. 

Figure [9] shows the leverage correlation of the Heston model as an illustrative example. 
It compares ML approach with other ways of extracting volatility from data. Figure [9] 
demonstrates that ML approach gets same results as by using cr pr op given by Eq. (J6| but it 
also shows how we lose the correlation if we take the deconvoluted o^econ given by Eq. 
Again, the result can be considered as a proof that our methodology is coherent and self- 
consistent. The other two models show very similar results as can be intuited in Fig. |8j 



E. Different market indexes 



We have studied how our ML approach affects different SV models and we here would also 
like to verify if there is any difference between working with one stock market or another. 
Concretely, we have computed our estimation of the volatility for the following indexes: Dow 
Jones Industrial Average (DJI) (1928-2011), Standard and Poor's-500 (S&P) (1950-2011), 
German index DAX (1990-2011), Japanese index NIKKEI (1984-2011), American index 
NASDAQ (1985-2011), British index FTSE-100 (1984-2011), Spanish index IBEX-35 (1993- 
2011) and French index CAC-40 (1990-2011). It is also important to stress that parameters 
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FIG. 8: Leverage correlations (26) of the expOU, the OU and the Heston models. Volatilities 



are calculated using the ML method (19) and |AX| is articially computed combining Gaussian 



realizations of AW± and taking the estimated volatility. 
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FIG. 9: Leverage correlations (26) of the ML estimated volatility (19) for the Heston model com- 



pared with the deconvoluted procedure ([7]) and the proportional volatility Q. 

used in each model are the ones from Dow Jones data and provided in Tab. [IT] so in some 
sense there is now no over fitting due to the fact of extracting the parameter from the same 
data series we are analyzing. In all cases, the resulting AX time series satisfies the stylized 
facts that most of financial markets have in common [21 Hj. 

We first observe that all markets show an estimated volatility considerably less noisy 



than the deconvoluted one (cf. Eqs. (19) and (m)). The reduction of the oscillations can be 
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TABLE V: Values of the coefficient Var(<r es ^)/Var((T ( j econ ) for all the indexes. We show the values 
calculated using the expOU, the OU and the Heston models. 





expOU 




OU 




Heston 




DJI 


8.6 x 10" 


-7 


5.0 x 10" 


-7 


2.5 x 10" 


7 


S&P 


3.0 x 10" 


-5 


1.7 x 10" 


-5 


6.3 x 10" 


6 


DAX 


7.3 x 10" 


-7 


3.4 x 10" 


-7 


1.2 x 10" 


-7 


NIKKEI 


2.5 x 10" 


6 


1.8 x 10" 


-6 


7.2 x 10" 


-7 


NASDAQ 6.8 x 10" 


6 


4.9 x 10" 


-6 


2.0 x 10" 


-6 


FTSE-100 3.7 x 10" 


-7 


2.6 x 10" 


-7 


8.4 x 10" 


-8 


IBEX-35 


2.2 x 10" 


-4 


1.9 x 10" 


-4 


5.3 x 10" 


-4 


CAC-40 


3.4 x 10" 


-6 


2.1 x 10" 


-6 


9.0 x 10" 


-7 



quantified by the coefficient 

V ar (^est) / 2? x 
Var^decon) ' 

whose order of magnitude depends on the stock data as shown in Tab. [V} 



In Fig. 10, we plot the volatility pdf given by the Heston model for two different indexes. 
We notice the different width of the probability distribution of the two stocks because each 
market has a different volatility's range of values. We can again appreciate the reduction of 
the fluctuations achieved with our estimated volatility when compared with the deconvoluted 
volatility Q. 

Figure [TT] shows the probability distribution of the artificially computed return differences 
with the estimated volatility of each stock. In this case, we have used the expOU model. As 
we expected, we see that the width of the curves depend on the stock market but behavior 
is qualitatively similar. This also similarly occurs to the Heston and OU models. 

In order to study the extreme values of the indexes, we have calculated the MFPT for the 
absolute value of returns |AX| calculated using the estimated volatility. In top Fig. 12 we 
have plotted the evolution of this MFPT when the model used is the expOU. We observe the 
clear coincidence of all the stocks except the Dow Jones which has slightly smaller MFPT 
which is incidentally the market from where parameters are extracted. If we look at the 
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FIG. 10: Probability distribution of the estimated volatilities (19) of the Standard and Poor's-500 
(S&P) and the IBEX-35. We plot our estimated volatility for the Heston model jointly with the 
deconvoluted volatilities provided by Eq. Q. 
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FIG. 11: Comparison between the probability density of the return differences AX artificially 
computed by using Eq. ^ and by taking c es ^ (19). The expOU model has been used in order to 
calculate the estimated volatility. All markets show similar aspect. 



the OU case as shown in Fig. [13] it is the IBEX stock index that shows a different behavior 
specially to the range of small threshold L. This can be justified by the fact that OU 
model allows for negative values of volatility while ML is just considering positive values of 
volatility. And the results for small L will be the ones that can be more sensitive to this 
fact. Additionally the IBEX market is the one with smallest amount of data available. The 
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FIG. 12: MFPT of the absolute value of return differences calculated using artificial absolute value 
of return difference with the estimated volatility cr es ^ (cf. Eqs. (|3j) and (19)). The estimated 
volatility has been computed using the expOU model. 
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FIG. 13: MFPT of the absolute value of return differences calculated using artificial absolute value 
of return difference with the estimated volatility cr es t (cf. Eqs. ^ and (19)). The estimated 
volatility has been computed using the OU model. 



Heston case shown in Fig. 14 recovers the nice collapse provided by the expOU model where 
the DJI again appears slightly shifted. In any case, and for the IBEX with the OU model 
single exception, a common pattern is observed. 



Finally, we show in Fig. 15 that there are some stocks which manifest more leverage than 
others. As an example, the S&P has bigger anti correlation than the Dow Jones. However, 
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FIG. 14: MFPT of the absolute value of return differences calculated using artificial absolute value 
of return difference with the estimated volatility cr es ^ (cf. Eqs. (|3j) and (19)). The estimated 
volatility has been computed using the Heston model. 



the important fact is that we find leverage in all markets. The same happens with the 
volatility autocorrelation because although the NASDAQ decays more slowly, all the stocks 
manifest significant autocorrelation for hundreds of days as expected [2]. Same results are 
found when we take the OU and expOU models instead of the Heston one. 



V. CONCLUSIONS 



It is fairly known that the volatility is one of the main quantities in finance because it 
is a measure of price fluctuations and it gives information related to the risk of holding 
an asset. However, volatility is a magnitude which is not directly observable and one then 
needs to assume a given market model in order to infer the volatility value. Basic volatility 
estimation procedures have been presented and we have used a ML method that improves 
them since it is able to reduce noise and avoid bias in volatility signal. 

We have applied the ML method by considering the most basic version of the expOU, 
the OU and the Heston SV uncorrelated models and we have compared them with the de- 
convoluted volatility showing big improvement in many aspects. We have observed that the 
fluctuations of the estimated volatility are smaller in all the models than in the deconvoluted 
estimation. The three models preserve the desired stationary volatility pdf for the volatility 
and keep the fat tail distribution for the price return changes. We have also found that all 
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FIG. 15: Comparison between the leverage effect of the Dow Jones Industrial Average (DJ), Stan- 
dard and Poor's-500 (S&P), and NASDAQ. The inset Figure shows their volatility autocorrelation. 
The Heston model has been used in order to calculate the estimated volatility and the correspond- 
ing artificial return time series of each stock. The OU and expOU models and the rest of market 
indicies studied show identical results. 

three models allow us to forecast future absolute value of returns with actual volatilities. 
We have also observed that the loss of forecast information has a double time scale in the 
expOU and the OU models. 

Concerning the study of extreme events, we have found that our ML approach shows a 
nice concordance between the volatility MFPT estimated with the three SV models and the 
empirical MFPT. We have also focused on volatility's time correlations and we have observed 
that all the three models show the existence of significant volatility autocorrelation for 
hundreds of days although Heston and OU models does not include this property beforehand. 
The leverage correlation that crosses volatility and price return fluctuations is also nicely 
described by all three models even though the ML method is not considering correlation 
between returns and volatility fluctuations beforehand. All of these confirm the fact that 
methodology is robust enough without needing to improve the SV models or to provide 
more efficient ways of estimating the parameters of the model. However, ML approach with 
alternative models with same level of sophistication like the recent model by Delpini and 
Bormetti [12] deserves attention in future research. 

Finally, we have applied same method to other stock indexes. Volatility's noise has been 
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strongly reduced in all cases and we have corroborated that all the markets describe the 
several properties described before for the Dow Jones. The methodology therefore seems to 
be valid in a wide collection of financial market data. 
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